Rank in Wordlist | Frequency | Word |
---|---|---|
3323 | 326 | 1,5 |
4841 | 215 | 2,5 |
5285 | 194 | 1,2 |
5769 | 175 | 1,6 |
6524 | 152 | 3,5 |
6863 | 144 | 1,3 |
7087 | 139 | 1,7 |
7130 | 138 | 1,1 |
7816 | 125 | 1,4 |
7928 | 123 | 1,8 |
Rank in Wordlist | Frequency | Word |
---|---|---|
7644 | 128 | 100% |
7874 | 124 | 50% |
7976 | 122 | 10% |
8790 | 109 | 20% |
8791 | 109 | 30% |
10175 | 93 | 2% |
11245 | 83 | 80% |
11729 | 79 | 5% |
13363 | 68 | 40% |
13364 | 68 | 70% |
Rank in Wordlist | Frequency | Word |
---|---|---|
6249 | 160 | S&P |
28349 | 28 | Pulp&Paper |
45765 | 15 | H&M |
48160 | 14 | H&Mi |
67818 | 9 | T&A |
81244 | 7 | Pulp&Paperi |
88495 | 6 | AT&T |
90920 | 6 | S&P500 |
100613 | 5 | C&R |
116478 | 4 | Anne&Stiil |
Rank in Wordlist | Frequency | Word |
---|---|---|
46 | 14451 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
10679 | 88 | Google'i |
12159 | 76 | Neuville'i |
13713 | 66 | Apple'i |
14657 | 61 | France'i |
14874 | 60 | Giro d'Italia |
14928 | 60 | d'Italia |
16444 | 54 | Youtube'i |
18158 | 48 | France'il |
20344 | 42 | League'i |
20803 | 41 | Ypres'i |
Rank in Wordlist | Frequency | Word |
---|---|---|
2770 | 394 | 2+2 |
21109 | 40 | 2+1 |
57291 | 11 | 1+1 |
99759 | 5 | 0+0 |
99760 | 5 | 0+1 |
115952 | 4 | 3+1 |
116045 | 4 | 5+1 |
138929 | 3 | 0+8 |
139283 | 3 | 17+1 |
139537 | 3 | 3+10 |
Rank in Wordlist | Frequency | Word |
---|---|---|
80164 | 7 | I Wear* Experiment |
Rank in Wordlist | Frequency | Word |
---|---|---|
3747 | 286 | Kalev/Cramo |
5248 | 196 | km/h |
8612 | 112 | ja/või |
8896 | 108 | m/s |
14065 | 64 | Kalev/TLÜ |
17223 | 51 | 24/7 |
17558 | 50 | Kehra/Horizon |
18141 | 48 | 2/3 |
23547 | 35 | 1/2 |
23754 | 35 | https://www |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots